Yet Another Summarization System with Two Modules using Empirical Knowledge
نویسندگان
چکیده
We previously proposed a summarization system, GREEN, for Japanese newspaper editorials. However, GREEN is not suitable for summarizing ordinal newspaper articles which are different from newspaper editorials. To participate in subtasks A-1 and A-2 of TSC (text Summarization Challenge) in NTCIR-2, we developed a new summarization system from scratch which copes with both ordinal articles and editorials in a Japanese newspaper. The new summarization system resulted in good evaluations: the mean value of all evaluations held the foremost place among ten systems in subtask A-1 and nine systems in subtask A-2, respectively.
منابع مشابه
Multilingual summarization system based on analyzing the discourse structure at MultiLing 2013
This paper describes the architecture of UAIC 1 ’s Summarization system participating at MultiLing – 2013. The architecture includes language independent text processing modules, but also modules that are adapted for one language or another. In our experiments, the languages under consideration are Bulgarian, German, Greek, English, and Romanian. Our method exploits the cohesion and coherence p...
متن کاملA survey on Automatic Text Summarization
Text summarization endeavors to produce a summary version of a text, while maintaining the original ideas. The textual content on the web, in particular, is growing at an exponential rate. The ability to decipher through such massive amount of data, in order to extract the useful information, is a major undertaking and requires an automatic mechanism to aid with the extant repository of informa...
متن کاملGénération de résumés par abstraction complète
This Ph.D. thesis is the result of several years of research on automatic text summarization. Three major contributions are presented in the form of published and yet to be published papers. They follow a path that moves away from extractive summarization and toward abstractive summarization. The first article describes the HexTac experiment, which was conducted to evaluate the performance of h...
متن کاملSummarization of Multimodal Information
Information Summarization is one of the key challenges for current and future information systems. In this paper, we will outline a system that comprises modules for summarizing texts and time series to study the link between the two. Summaries of texts are generated using a lexical analysis of cohesion in texts focusing on key sentences that provide cohesion: by implication, these are the sent...
متن کاملSummarization Focusing on Polarity or Opinion Fragments in Blogs
We present the TUT opinion summarization system which participated in the TAC 2008. The system consists of two modules: opinion/polarity automatic annotation module and fragment extraction module for summarization. Our research objective is to estimate the effectiveness of opinion/polarity annotation per sentence units for opinion summarization. The evaluation results showed that the polarity a...
متن کامل